Integrating Verbal Idioms into an NLP System
نویسندگان
چکیده
This paper describes the integration of verbal idioms into an Natural Language Processing (NLP) system, adopting a construction approach, which is based on the prior parsing stage, so that these MultiWord Expressions (MWE) can be taken into account in subsequent tasks, such as semantic role labeling or whole-part relation extraction. The paper focuses on body-part nouns, which are often part of many verbal idioms, and uses a manually annotated corpus to evaluate its parsing strategy. Results showed a precision of 0.92, 0.83 recall, 0.87 f-measure and an accuracy 0.99.
منابع مشابه
COST Action IC1207 PARSEME meeting
Dealing with idioms in Natural Language Processing systems is difficult, among other reasons, because their architecture must be conceived in such a way that it should not preclude the processing of both free word combinations and these, more constraint, expressions. On the other hand, many idioms do have syntactic structure, and can undergo several types of formal variation, thus making them h...
متن کاملOn the Effects of Linguistic, Verbal, and Visual Mnemonics on Idioms Learning
Finding more effective ways of teaching second language idioms has been a long standing concern of many teaching practitioners and researchers. This study was an endeavorto explore the effects of three linguistic mnemonic devices (etymological elaboration, keyword method, and translation) on EFL learners’ recognition and recall of English idioms. To achieve the purpose of the study, ninety male...
متن کاملReusable Lexical Representations for Idioms
In this paper I introduce (1) a technically simple and highly theory-independent way for lexically representing flexible idiomatic expressions, and (2) a procedure to incorporate these lexical representations in a wide variety of NLP systems. The method is based on Structural EQuivalence Classes for Idioms and therefore called the SEQCI method. I illustrate the approach using the Rosetta MT sys...
متن کاملImplementing European Portuguese Verbal Idioms in a Natural Language Processing System
This paper is based on an extant lexicon-grammar of European Portuguese verbal idioms (e.g., deitar mãos à obra, literally, ̳to throw hands to the work‘, ̳to start working‘.). This a database containing about 2,400 expressions, along with all relevant information on the sentence structure, distributional constraints and transformational properties of these frozen sentences. In this paper, we pr...
متن کاملElaborating the parameterized Equivalence Class Method for Dutch
This paper discusses the parameterized Equivalence Class Method for Dutch, an approach developed to incorporate standard lexical representations for Dutch idioms into representations required by any specific NLP system with as minimal manual work as possible. The purpose of the paper is to give an overview of parameters applicable to Dutch, which are determined by examining a large set of data ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2014